Refine all matmul and binary ops #54

pinzhenx · 2020-06-05T06:09:31Z

Fix #9
Fix #48
Fix #50

jiayisunx · 2020-06-11T05:45:47Z

relevant UT:
def test_bmm_(self):
ipex.core.enable_auto_dnnl()
ipex.core.enable_mix_bf16_fp32()
a = torch.randn(8, 12, 128, 64).to('dpcpp')
b = torch.randn(8, 12, 128, 64).to('dpcpp')
attention_scores = torch.matmul(a, b.transpose(-1, -2))

pinzhenx · 2020-06-19T09:07:18Z

@EikanWang The PR is ready for review. PTAL.

pinzhenx · 2020-06-19T09:30:11Z

throw exceptions on broadcastable inputs for fallback
Some matmul and binary ops support broadcast while dnnl does not. So we will check the broadcastable inputs and throwing an exception to bypass these cases. Plus, we can now safely remove the workaround code in jit that temporarily disabling dnnl ops during bn folding.
For those topologies that’s heavily relied on broadcasting, it might cause performance issue, and we should disable some ops on a case-by-case basis later or implement broadcast in dnnl.
strengthen shape check for matmul
Our old implementation shared some code between 2d matmul and 3d matmul, which was causing some issues. Now they are completely separated.
add dil_size
We will query the size of a tensor to check shape for matmul. dil_size can prevent it falling back to cpu
refine out=... behavior.
Now all ops with a out=... parameter will basically discard the old storage and replace its storage with a dil tensor yielded from dil ops. No matter whether its underlying storage is enough or not, we will discard it at all. This is slightly different from pytorch resizing behavior, where the storage will be replaced only if it’s smaller than the required output size.

EikanWang · 2020-06-22T02:02:05Z

torch_ipex/csrc/cpu/dbl/Common.cpp

@@ -89,6 +89,15 @@ void reorder_to_desc(const at::Tensor& tensor, const dil::tensor::desc& expected
 }

 void equip_dil_buffer(const at::Tensor& tensor, dil::tensor dil_tensor_buffer) {
+  TORCH_INTERNAL_ASSERT_DEBUG_ONLY(


Should we user TORCH_CHECK here?

EikanWang · 2020-06-22T02:07:02Z

torch_ipex/csrc/cpu/dbl/Common.cpp

+  // After equip_dil_buffer(), whole storage should be managed by dil tensor,
+  // and thus storage metadata should be overwritten by dil tensor 
+  // Note: Storage::set_numel() might be removed later
+  ipex_tensor_impl->storage().set_numel(dil_tensor_buffer.get_nelems());


Does that mean the number of elements of dil tensor will not be as same as the number of ATen tensor (dilation/padding) ?

We may pass a dummy tensor with zero size as output and let our ops to replace its buffer with concrete sizes.

EikanWang · 2020-06-22T03:19:07Z

LGTM

* add ops into blacklist * clean format * remove tok from blacklist

pinzhenx marked this pull request as draft June 5, 2020 06:41

pinzhenx force-pushed the resize branch 6 times, most recently from 68fb1cf to e24f42d Compare June 19, 2020 03:27

pinzhenx force-pushed the resize branch from 6b835d2 to a8af68a Compare June 19, 2020 08:53

pinzhenx marked this pull request as ready for review June 19, 2020 09:02

pinzhenx changed the title ~~fix *mm ops and support resizing behavior~~ Refine all matmul and binary ops Jun 19, 2020

refine all mm and binary ops

a8af68a

EikanWang reviewed Jun 22, 2020

View reviewed changes

pinzhenx marked this pull request as draft June 22, 2020 02:31

pinzhenx marked this pull request as ready for review June 22, 2020 02:52

EikanWang self-requested a review June 22, 2020 03:19

EikanWang approved these changes Jun 22, 2020

View reviewed changes

EikanWang merged commit 38f9fa7 into intel:master Jun 22, 2020

zhuhaozhe pushed a commit to zhuhaozhe/intel-extension-for-pytorch that referenced this pull request Jun 24, 2020

refine all mm and binary ops (intel#54)

f616acb

EikanWang pushed a commit that referenced this pull request Oct 4, 2021

update blacklist based on op test (#54)

02474eb

* add ops into blacklist * clean format * remove tok from blacklist

Steve-Tech mentioned this pull request Aug 6, 2023

RuntimeError: Number of dpcpp devices should be greater than zero! #287

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refine all matmul and binary ops #54

Refine all matmul and binary ops #54

Uh oh!

pinzhenx commented Jun 5, 2020 •

edited

Loading

Uh oh!

jiayisunx commented Jun 11, 2020

Uh oh!

pinzhenx commented Jun 19, 2020

Uh oh!

pinzhenx commented Jun 19, 2020 •

edited

Loading

Uh oh!

EikanWang Jun 22, 2020

Uh oh!

EikanWang Jun 22, 2020

Uh oh!

pinzhenx Jun 22, 2020

Uh oh!

EikanWang commented Jun 22, 2020

Uh oh!

Uh oh!

Refine all matmul and binary ops #54

Refine all matmul and binary ops #54

Uh oh!

Conversation

pinzhenx commented Jun 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jiayisunx commented Jun 11, 2020

Uh oh!

pinzhenx commented Jun 19, 2020

Uh oh!

pinzhenx commented Jun 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EikanWang Jun 22, 2020

Choose a reason for hiding this comment

Uh oh!

EikanWang Jun 22, 2020

Choose a reason for hiding this comment

Uh oh!

pinzhenx Jun 22, 2020

Choose a reason for hiding this comment

Uh oh!

EikanWang commented Jun 22, 2020

Uh oh!

Uh oh!

pinzhenx commented Jun 5, 2020 •

edited

Loading

pinzhenx commented Jun 19, 2020 •

edited

Loading